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METHOD OF PRODUCING ANTISENSE OLIGONUCLEOTIDE 
FIELD OF THE INVENTION 

The present invention relates to a method of producing 
antisense oligonucleotide. 
BACKGROUND OF THE INVENTION 

The antisense oligonucleotide method is a method of 
inhibiting the expression of a protein by hybridizing, to a 
target gene, oligonucleotide (antisense oligonucleotide) having 
a sequence complementary or substantially complementary to the 
target gene, and this method is used in e.g. the preparation of 
medical drugs or genetic recombinant plants. 

In the antisense oligonucleotide method, it is first 
necessary to prepare oligonucleotide with a nucleotide sequence 
having an inhibitory effect (antisense oligonucleotide effect or 
antisense effect) on expression of a gene by hybridization to 
the gene. Preparation of oligonucleotide with such a nucleotide 
sequence includes a method of synthesizing oligonucleotide 
complementary to experimentally found antisense-ef fective region 
and a method of synthesizing oligonucleotide complementary to 
predicted antisense-ef fective region without experiment. 

In the former method, however, it is necessary to prepare 
oligonucleotides whose sequences are complementary to a target 
gene and to experimentally determine whether they show antisense 
effect on the target gene, which results in a long period of 
time, higher costs and complicated procedures. 

In the latter method, an initiation site for translation, 
its upstream non-coding region, or other region are selected 
empirically as a target site and then antisense oligonucleotides 
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to the target site are prepared. Concerning the latter method, 
various methods are proposed (K. R. Blake et al., Biochemistry, 
24, 6132-6138 (1985); E. Uhlmann, A. Peyman, Chemical Reviews, 
90, 543-584 (1990)), but their reliabilities are low and the 
antisense oligonucleotides selected in these methods do not 
always effectively inhibit the expression of the target protein 
(e.g. R. D. Ricker, A. Kaji, FEBS Letters, 309, 363-370 (1992). 
There are also proposals for predicting a target site of 
antisense oligonucleotide based on calculation of energy for 
formation of secondary structure of mRNA or its precursor, or 
energy for formation of a hybrid between mRNA or its precursor 
and antisense oligonucleotide, or the difference between these 
two energies. However, antisense oligonucleotides selected in 
these methods are low in reliability and are not always 
effective (R. A. Stull et al., oligonucleotide Research, 20, 
3501-3508 (1992)). 

In the antisense oligonucleotide method, therefore, there 
is demand for highly reliable prediction of an effective 
antisense oligonucleotide, that is, antisense oligonucleotide 
having a nucleotide sequence to effectively inhibit expression 
of mRNA (or its precursor) coding for a target protein, in order 
to facilitate the preparation of antisense oligonucleotide. 

SUMMARY OF THE INVENTION 

The object of the present invention is to provide a method 
of producing antisense oligonucleotide in which the antisense 
oligonucleotide can be obtained efficiently without conducting 
any experiment. 

As a result of eager research, the present inventor found 



2 



that antisense oligonucleotide can be prepared as the following. 
A possibility of forming a substantially complementary duplex 
between a specific region in mRNA and every other region in the 
same mRNA is calculated and oligonucleotide having a 
substantially complementary sequence to the specific region with 
a low value for the sum of the duplex- forming possibility is 
prepared . 

That is, the present invention is a method of producing 
antisense oligonucleotide, in which the possibility of forming a 
substantially complementary double-stranded chain between each 
region of a nucleotide sequence in mRNA and a region other than 
said region is expressed as a numerical value, and 
oligonucleotide substantially complementary to a region with a 
smaller numerical value is prepared as antisense oligonucleotide. 

The above possibility is numerically expressed on the 
basis of distance between two nucleotide sequence regions 
forming a substantially complementary double-stranded chain, and 
more specifically the possibility is expressed as a lower value 
as the distance increases, and it is expressed as the largest 
numerical value when there are 3 to 10 bases, preferably 4 to 6 
bases between the two nucleotide sequence regions. Further, the 
possibility is expressed on the basis of the bond energy for 
forming a double-stranded chain, and more specifically it is 
expressed as a larger numerical value when the bond energy is 
higher. The bond energy herein used can be obtained using the 
nearest neighbor model. 

In particular, the above possibility is expressed most 
preferably as a numerical value based on the distance between 
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the two nucleotide sequence regions and the bond energy for 
forming a substantially complementary double-stranded chain. 
BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows an outline of a method of calculating the 
possibility of forming a double-stranded chain of mRNA. 

FIG. 2 (A) to (C) show an outline of a method of calculating 
the possibility of forming a double-stranded chain of mRNA. 

FIG. 3 (A) and (B) show an outline of a method of calculating 
the possibility of forming a double-stranded chain of mRNA. 

FIG. 4 shows a flow chart for a method of calculating the 
possibility of forming a double-stranded chain of mRNA. 

FIG. 5 shows results for the calculated possibility of 
VEGF mRNA to form a substantially complementary double-stranded 
chain and results for experimentally-determined expression of 
VEGF. 

FIG 6 shows the expression of VEGF in the presence of 
antisense oligonucleotide in a cell-free transcription and 
translation system. 

In FIGS. 2 and 3, 1 is a loop, 2 is a loop, 3 is a stem, 4 
is a stem, 5 is a loop, and 6 is a loop. 

DETAILED DESCRIPTION OF THE INVENTION 

Hereinafter, the present invention is described in detail. 

mRNA can form a double-stranded chain if a certain 
specific region is substantially complementary to another region 
in the same mRNA. This property is utilized in the present 
invention to prepare antisense oligonucleotide as the following. 
A possibility of forming a substantially complementary chain 
between a specific region in mRNA and every other region in the 



same mRNA is evaluated numerically and then an antisense 
oligonucleotide substantially complementary to a region assigned 
a summed low numerical value as this possibility is prepared, 
assuming that a region having a summed high numerical value is 
readily forming a substantially complementary chain while a 
region having a substantially complementary sequence to a summed 
low numerical value is hardly forming a substantially 
complementary chain, i.e. the region is remaining in a single- 
stranded chain. 

In the present invention, the possibility of forming a 
substantially complementary chain (also referred to hereinafter 
as "the ability to form a substantially complementary chain") is 
numerically expressed. The term "complementary" refers to the 
state in which a specific region in mRNA and another specific 
region in the same RNA hybridize to each other (i.e. forming a 
base pair), and for example, it refers to the relationship 
between adenine (A) and uracil (U) or the relationship between 
cytosine (C) and guanine (G) . In the present invention, the 
term "complementarity" means that a specific region and another 
region in the same mRNA hybridize to each other with 
antiparallel direction, that is, a certain region in the 5 f -*3' 
direction of the sequence hybridizes to another region in the 3 ' 
-*5 f direction of the sequence. 

Further, with the term "substantially" given, the base 
pair in the present invention is not limited to the base pair of 
G and C or the base pair of A and U, and it includes any other 
base pair if these bases hybridize to each other. Therefore, 
the base pair can include not only the base pair of G and C or A 
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and U but also the base pair of G and U. Furthermore, if the 
substantially complementary chain is sufficiently long (e.g. 10 
bases or more), a few mismatch base pairs may occur. 

Hereinafter , the specific means of numerically expressing 
the ability to form a substantially complementary double- 
stranded chain is described. 

First, whether a region beginning at a base at a specific 
site and consisting of continuous 1 or more bases, preferably 2 
to 4 or more bases, in a specific direction (a predetermined 
direction of S'-*^' or 3 , -*5 l ) in the mRNA (or its precursor), 
is substantially complementary to a region apart by 4 bases or 
more (preferably 5 bases or more) from said region is determined 
by examining the whole nucleotide sequence of the RNA. If they 
are substantially complementary, it is determined whether their 
respective next bases are substantially complementary to each 
other, and if they are still substantially complementary, their 
further next bases are examined for substantial complementarity. 
In this manner, a possible maximum substantially complementary 
region is identified. This region containing the specific site 
is a specific region in the mRNA. Then, the possibility that 
this specific region and its corresponding substantially 
complementary region form a substantially complementary double- 
stranded chain is numerically expressed. The computer program 
for this is set such that insofar as they are apart from each 
other by a predetermined distance, a larger numerical value is 
given for a shorter distance between the substantially 
complementary regions and a smaller numerical value is given for 
a longer distance. Also, the calculation program is set such 
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that a larger numerical value is given when the energy for 
forming a double-stranded chain is larger and a smaller 
numerical value is given when the energy is smaller. The value 
thus numerically expressed is assigned to each base in said 
specific region to form said double-stranded chain (and also to 
each base in said substantially complementary region to form a 
double-stranded chain) . 

Then, a next other region substantially complementary to 
said region beginning at the specific site is identified in the 
same manner as described above, and the ability of the 
identified complementary region and said specific region to form 
a substantially complementary double-stranded chain is 
numerically expressed in the same manner. Then, this numerical 
value is added to the previously determined numerical value 
assigned to each base. In this manner, a region beginning at 
the specific site in mRNA is examined for its substantial 
complementarity to every other region in the same mRNA, and if 
they are complementary, their ability to form a substantially 
complementary double-stranded chain is numerically expressed in 
the same manner as above. This numerical value is assigned to 
the corresponding base (if there is a previous value assigned to 
the base, it is assigned to the base by adding it to the 
previous value). If the computer program is set such that this 
numerical value is assigned to both the specific region and its 
substantially complementary region, the value is also assigned 
to the latter region in the same manner. 

Every possible region in the mRNA is examined in the same 
manner, and the ability of each region to form a substantially 
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complementary double-stranded chain is determined as the sum of 
the numerical values thus obtained for each base. To determine 
the sum of the numerical values for each base, the numerical 
value obtained between certain bases and their corresponding 
complementary bases within a substantially complementary region 
shall not be added twice or more times. For example, let us 
suppose a region of 2 bases or more beginning at the 10-position 
towards the 3 1 -terminal (where the 5'- terminal is given the 1- 
position) is examined as a specific region for its substantial 
complementarity to 2 bases or more beginning at the 30-position 
towards the 5* -terminal. If it is assumed that 4 bases 
beginning at the 9 -position towards the 3* -terminal are found to 
be substantially complementary to 4 bases beginning at the 31- 
position towards the 5' -terminal, then a numerical value 
obtained between 3 bases beginning at the 10-position towards 
the 3' -terminal and 3 bases beginning at the 30-position towards 
the 5 '-terminal or a numerical value obtained between 2 bases 
beginning at the 11-position towards the 3' -terminal and 2 bases 
beginning at the 29-position towards the 5' -terminal are not 
counted as the ability to form a substantially complementary 
double-stranded chain. 

If a certain specific region is substantially 
complementary to another region, the possibility of forming a 
double-stranded chain by these 2 regions is expressed with a 
significant value, while if a certain specific region is not 
substantially complementary to another region, the possibility 
of forming a double-stranded chain by these 2 regions is 
expressed as zero (0). For this purpose, we can use a computer 
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programme to calculate the possibility. 

The type of mRNA to which the antisense oligonucleotide is 
prepared according to the present method, is not particularly 
limited. Examples of such mRNA are mRNA for vascular 
endothelial growth factor derived from animals such as humans or 
rats, mRNA for heme oxygenase I type or II type derived from 
animals such as humans or rats, and mRNA for lucif erase derived 
from firefly. 

The method of numerically expressing the possibility of 
forming a double-stranded chain by a part of the sequence of 
mRNA and a different part in the mRNA is outlined in FIG. 1. 

To examine a specific region in mRNA for complementarity, 
the length of its fragment is preferably 2 to 4 base pairs or 
more. FIG. 1 shows the case where the whole length of mRNA is 
50 bases and the length of a specific region is 2 bases or more. 
In FIG. 1, it is determined whether 2 bases (bases at the 1- and 
2 -positions, referred to as a x ) at the 5' -terminal in mRNA and 2 
bases (bases at the 50- and 49-positions , referred to as b x ) at 
the 3 1 -terminal in the same mRNA are substantially complementary 
(A x ). If a x and b x are not substantially complementary, then it 
is determined whether a x is complementary to 2 bases (bases at 
the 49- and 48-positions , referred to as b 2 ) shifted by 1 base 
to the 5' -terminal from b 1 (FIG. 1, A 2 ). This procedure is 
repeatedly carried out to determine whether there is a sequence 
complementary to a x in the whole region of mRNA except for 
region required for minimum distance (3 bases between the two 
chains) (FIG. 1, A x to A 44 ). If a 1 is found to be complementary 
to 2 bases (the 46- and 45-positions, referred to as b 5 ) apart 
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by 4 bases from the 3 1 -terminal (FIG 1, A 5 ), then their next 
bases, i.e. a base at the 3 -position and a base at the 44- 
position, are examined for substantial complementarity. If said 
2 bases are found to be substantially complementary to each 
other, their next bases, i.e. a base at the 4 -position and a 
base at the 43-position are further examined for substantial 
complementarity (see and " in A 5 , FIG. 1). The same 

procedure is repeatedly carried out insofar as substantially 
complementary bases continue (provided that a minimum number of 
bases required for forming a loop between the substantially 
complementary bases are secured), and the ability of the 
complementary nucleotide sequences to form a double-stranded 
/ --s chain is numerically expressed. This numerical value is given 
to each base in sl x forming a complementary nucleotide sequence 
beginning at the 1-position (and to each base in b 5 which forms 
a substantially complementary double-stranded chain with a 2 ). 
Bond energy can be further used as a numerical value for 
reflecting the ability of said complementary base sequences to 
form a double-stranded chain, as described below: 

It can be assumed here that a higher ability of forming a 
substantially complementary double-stranded chain between a 
specific site and its corresponding substantially complementary 
site leads to formation of a more stable substantially 
complementary chain. 

The ability to form a substantially complementary double- 
stranded chain depends on experimentally-determined bond energy 
(Gibbs free energy) for formation of a substantially 
complementary double-stranded chain or calculated bond energy 
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based on the nearest neighbor model. Thus, the ability to form 
a substantially complementary double-stranded chain is 
numerically expressed by using such values. 

For example, the ability to form a substantially 
complementary double-stranded chain can be numerically expressed 
by using the "nearest neighbor model " (Naoki Sugimoto, 
"Seibutsubutsuri" (Biophysics), Vol. 33, pp. 1-7, (1993)) in the 
manner as described below. The "nearest neighbor model" is a 
model made on the ground that the formation of a base pair is 
most influenced by its next and previously formed base-pair. 
Bond energy A G generated between a dimer (i.e. consisting of a 
certain base and its next base) and its substantially 
complementary sequence is expressed in Table 1 below. 

Table 1 



Sequence A G 

(sense chain/antisense chain) (kcal/mol) 
5 .— 3 . 3 .<_ 5 . 



AA/UU 


-0.7 


UU/AA 


-0.7 


AU/UA 


-0.8 


CG/GC 


-1.9 


CU/GA 


-1.6 


AG/UC 


-1.6 


GA/CU 


-2.1 


UC/AG 


-2.1 


GC/CG 


-3.2 


GG/CC 


-2.8 


CC/GG 


-2.8 


GU/CA 


-2.0 


AC/UG 


-2.0 


UA/AU 


-0.9 



UG/AC 


-1.8 


CA/GU 


-1.8 


AG/UU 


-0.4 


UU/GA 


-0.4 


CG/GU 


-1.3 


UG/GC 


-1.3 


GG/CU 


-1.2 


UC/GG 


-1.2 


UG/AU 


-0.5 


UA/GU 


-0.5 


GG/UU 


-0.4 


UG/GU 


-0.5 


AU/UG 


-0.5 


GU/UA 


-0.5 


CU/GG 


-1.3 


GG/UC 


-1.3 


GU/CG 


-1.7 


GC/UG 


-1.7 


UU/AG 


-0.4 


GA/UU 


-0.4 


GU/UG 


-0.4 


UU/GG 


-0.4 



The ability to form a substantially complementary double- 
stranded chain can be calculated using both the bond energy of 
corresponding base pairs in Table 1 and the distance between 2 
strands in the double-stranded chain in the following equation: 
The ability to form a substantially complementary double- 
stranded chain = ( (L + l)/r) F -exp ( | A G | / RT ) (I) 
where | A G | is absolute value of bond energy , R is gas constant, 
T is absolute temperature, L is minimum number of bases in a 
loop, r is distance between one of double strands and another 
strand (number of bases between 2 chains (regions) +1), and F 



is evaluation index for distance r. 

L is in the range of 3 to 10, preferably 4 to 6. r is an 
integer of 1 or more. F is zero or a positive number which can 
be assigned e.g. 6, 1/3, 0.1, etc. 

For example, if a specific site is located at the 1- 
position and there is the sequence "GUAU" at the 43- to 46- 
positions substantially complementary to the nucleotide sequence 
"AUGC" at the 1- to 4-positions, then the bond energy AG for 
their formation of a double-stranded chain can be calculated on 
the basis of Table 1, as follows: 
AG = A(AU/UA) + A(UG/AU) + A (GC/UG) 

= (-0.8) + (-0.5) + (-1.7) 

= -3.0 

Also, r = 43 - 4 = 39. If L = 4, F = 6, and the 
temperature is 37 (T=310.15K), then the ability to form a 
substantially complementary double-stranded chain in this case 
can be numerically expressed as follows: 

The ability to form a substantially complementary double- 
stranded chain (1-4 and 46-43) 

= (5/39)*-exp(3.0 kcal/RT) 

= ( 5/39 ) 6 -exp( 3000/1. 9872X310. 15) 

= 5.77X10~ 4 

The resulting numerical value (5.77X10~ 4 ) is given to the 
respective bases at the 1- to 4-positions and the 43- to 46- 
positions . 

It is thereafter a region complementary (or substantially 
complementary) to a nucleotide sequence of at least 2 bases 
beginning at the 1-position as the specific site is searched, 
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and if present, its ability to form a (substantially) 
complementary double-stranded chain is calculated. For example, 
if the sequence at the 39- to 40-positions is "AU" and the 
sequence at the 15- to 17 -positions is "CAU" in the above 
example, the bases M AU " at the 1- to 2 -positions are 
complementary to the bases "AU" at the 39- to 40-positions and 
the bases "AUG" at the 1- to 3-positions are complementary to 
the bases "CAU" at the 15- to 1 7 -positions . Hence, the ability 
to form a substantially complementary double-stranded chain is 
numerically expressed for each case, and the numerical value for 
the former (double-stranded chain formed with bases at the 39- 
to 40-positions) is added to bases at the 1- to 2-positions (and 
bases at the 39- to 40-positions) and the numerical value for 
the latter (double-stranded chain formed with bases at the 15- 
to 17-positions) is added to bases at the 1- to 3-positions (and 
bases at the 15- to 17-positions). 

This operation is continued until the distance between the 
3' -side base in the specific region (a x ) and the 5' -side base in 
other regions {b lf b 2 , ...) reaches a predetermined number of 
bases (FIG. 1, A 44 ). The minimum value of "r" may be arbitrarily 
set e.g. at 4 to 11, preferably 5 to 7. 

In the above example, minimum "r" was set as 4. Thus, the 
complementarity to the sequence of a 1 is examined for sequences 
of b 1# b 2 ,.... to the sequence at the 7- to 6-positions (FIG. 1, 
) • 

In this way, a specific region beginning at a specific 
base (in this case, the 1 -position) is examined for substantial 
complementarity to continuous 2 or more bases throughout the 
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whole region of mRNA (excluding a region located with a distance 
of less" than "r" between a specific region beginning at a 
specific base (in this case, the 1 -position) and its 
corresponding complementary region), and if there is a 
substantially complementary region, the ability to form a 
substantially complementary chain is numerically expressed and 
this numerical value is assigned to each base in the specific 
region (and each base in its corresponding substantially 
complementary chain) or added to the value of a previous value 
if any. 

Then, whether a sequence (bases at the 2- and 3-positions, 
referred to as a 2 ) apart by 1 base from the 5* -terminal of the 
mRNA is substantially complementary to 2 bases (bases at the 50- 
to 49-positions, referred to as b x ) from the 3 '-terminal of the 
same mRNA is examined (FIG. 1, B x ). In the same manner as above, 
the sequence of a 2 is examined for substantial complementarity 
to the sequence of b x , b 2 , b 3 ... which are apart by 1 base from 
one another. If there is a substantially complementary region, 
substantial complementarity of their next bases is then 
determined as described above. For example, as shown in B 2 , 
whether a base next to a 2 (base at the 4-position from the 5 1 - 
terminal) is substantially complementary to a base at the 47- 
position (base at the 4-position from the 3' -terminal) in b 2 is 
examined. If they are substantially complementary, whether 
their next bases (i.e. base 5 in a 2 and base 46 in b 2 ) are 
substantially complementary and so forth (see arrows in B 2 in 
FIG.l) . 

This procedure is repeatedly carried out, and the 
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possibility of forming a substantially complementary chain is 
numerically expressed for each substantially complementary 
region (i.e. A 5/ B 2 , C x , and C 3 in FIG. 1), and this numerical 
value is assigned to each base in the specific region (and each 
base in its corresponding chain) forming a substantially 
complementary double-stranded chain to finish the procedure (FIG. 
1/ Z). 

The computer program for numerically expressing the 
possibility of forming a substantially complementary double- 
stranded chain is set such that the numerically expressed 
possibility of forming a substantially complementary chain is 
assigned to one strand (region) and/or another strand (region) 
forming a double-stranded chain. In the above example (FIG. 1), 
the numerically expressed possibility of forming a substantially 
complementary chain is assigned to both of the strands forming a 
double-stranded chain. If the numerically expressed possibility 
of forming a substantially complementary chain is assigned to 
either (region) of the strands in FIG. 1, the value may be 
assigned to the regions a x , a 2 , a 3 ,..., or alternatively to the 
regions b lr b 2 , b 3 ,... . In other words, if the value is 
assigned to only the regions a 1# a 2 , a 3 ,..., the value is not 
assigned to the regions b x , b 2 , b 3 , . . . . If the numerically 
expressed possibility of forming a substantially complementary 
chain is assigned to only one (region) of the strands, whether a 
nucleotide sequence at the 2- to 1-positions is complementary to 
a nucleotide sequence at the 49- to 50-positions as the specific 
region is determined, and if they are substantially 
complementary, the possibility of forming a double-stranded 
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chain is numerically expressed and then assigned to bases at the 
49- andf BO-position by adding the value to a previous value if 
any. To evaluate the possibility of forming a substantially 
complementary chain, the computer program is set such that the 
possibility of forming a substantially complementary chain is 
expressed as a smaller value when the "r M value is larger. For 
example, if in FIG. 2(A), m x is substantially complementary to 
m 2 and m 4 , the ability of m 1 and m 2 to form a substantially 
complementary chain is calculated to be higher or not lower than 
that of the ability of m x and m^ because the distance r x between 
m l and m 2 is shorter than the distance r 2 between m x and 1114. If 
m 1 and m 2 form a substantially complementary double-stranded 
chain, a loop of single-stranded chain is formed (loop 1 in FIG. 
2(B)). In this case, the possibility of forming a loop of 
single-stranded chain by linkage between m x and m 2 (loop 1 in 
FIG. 2 (B)) is higher or not lower than the possibility of 
forming a loop of single-stranded chain by linkage between m x 
and 104 (loop 2 in FIG. 2 (C)). 

Assuming in FIG. 3 (A) and (B) that m x and m 2 are 
substantially complementary to m 3 and m* respectively to form 
double-stranded chains (stems 3 and 4) and that the length of 
loop 5 formed by linkage between m 1 and m 3 is equal to the 
length of loop 6 formed by linkage between 114 and 1114, and if the 
bond energy for forming a double-stranded chain between m x and 
m 3 is higher than that between m 2 and m 4 , the computer program is 
set such that the base pair with the higher bond energy is given 
a higher ability to form a substantially complementary double- 
stranded chain. 



In FIG. 3, therefore, the possibility of forming the 
double-stranded chain (stem 3) between m 1 and m 3 is higher than 
the possibility of forming the double-stranded chain (stem 4) 
between m 2 and m 4 . 

Then, the respective values derived from the possibility 
of forming substantially complementary double-stranded chain is 
summed up for each base. In FIG. 1, the summed possibility (PI) 
for the base at the 1 -position to be substantially complementary 
to a base in another region is expressed as PI = p l9 and the 
summed possibility (P2) for the base at the 2-position to be 
substantially complementary to a base in another region is 
expressed as P2 = p x + p 2/ and P3 for the base at the 3-position 
is expressed as P3 = p 2 + p 3 + p 4 , and P4 for the base at the 4- 
position is expressed as P4 = p 3 + p 4 (FIG. 1). Similarly, the 
summed value possibilities (P5, P6, ... P50) are determined. 

A lower value of summed possibility indicates a lower 
possibility of forming a substantially complementary double- 
stranded chain, i.e. possibility of remaining a signal-stranded 
chain is high, and thus a base with a lower value is preferable 
as a target site for antisense oligonucleotide. A region 
consisting of 6 continuous bases or more, preferably 10 bases or 
more, more preferably 15 bases or more having a low value of 
summed possibility is particularly hard to form a substantially 
complementary double-stranded chain and is thus suitable as a 
target site for antisense oligonucleotide. In this manner, it 
is possible to predict an antisense oligonucleotide target site 
and to design antisense oligonucleotide substantially 
complementary to that site. 
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On the basis of this design, it is possible to synthesize 
a natural-type oligodeoxy ribonucleotide, a phosphorothioate-type 
oligodeoxyribonucleotide or a natural-type RNA, or an 
oligodeoxyribonucleotide or oligoribonucleotide modified at base, 
phosphate or sugar moiety. 

For synthesis, the solid-phase synthesis method such as 
amidite method or thiophosphite method with an automatic 
synthesizer or the liquid-phase synthesis method such as 
triester method can be used. Thus obtained oligonucleotide can 
be purified by reverse phase or ion-exchange-type HPLC or by 
cartridge to prepare antisense oligonucleotide. 

As for the computer program for the above method, we can 
use e.g. BASIC, FORTRAN, or C language or languages derived or 
developed therefrom. 

A flow chart of such a computer program is shown in FIG. 4. 

In FIG. 4, parameters L, D, F and NAS as well as a 
dimension (number of all bases) are given predetermined values 
(step 1). NAS is length of antisense oligonucleotide ranging 
from 10 to 30, preferably 15 to 25. 

Then, data (nucleotide sequence) are read (step 2). 

After the data were read, complementarity is examined. 
The position of each base in a nucleotide sequence beginning at 
a specific-site base (i) is expressed as I ( = i, i + 1, i+2, 
etc.), and the position of each base in a nucleotide sequence 
examined for substantial complementarity to said specific site 
is expressed as J (, = j, j - 1, j ~ 2, etc.). Before the 
substantial complementarity between a region beginning at I = i 
and a region beginning at J = j is examined, the substantial 



complementarity between I = i - 1 and J = j + i (wh ere i - 1^1, 
j + 1£N) is examined (step 3) in order to determine whether 
they are a part of a previously numerically expressed 
substantial complementary region, if a base at I = i - i and a 
base at J = j + i are substantially complementary, the 
substantial complementarity between a base at I = i and a base 
at J = j has already been examined. Therefore, it is not 
necessary to examine the substantial complementarity between 
regions beginning at I = i and J = j. 

If it is judged in step 3 that it is not necessary to 
examine the substantial complementarity (in the case of "yes" in 
step 3 because I = i - 1 and J = j + i are substantially 
complementary), then the substantial complementary between I = i 
- 1 and J = j is examined (this step is repeatedly carried out 
if the answer is "yes"). If I = i - l and J = j + l are not 
substantially complementary, it is then determined whether I = ± 
to i + D - l and J = j to j - D + l are substantially 
complementary (step 4). If they are not substantially 
complementary, the step of the program returns to step 3 and the 
substantial complementarity between I = i - 1 and J = j is 
examined, and if they are not substantially complementary, the 
substantial complementarity between I = i to i + d - 1 and j= 
j -1 to j - d is examined in the same manner as in I = i to i + 
D - 1 and J = j to j - D + l. If I = i to i + D - 1 and J = j 
to j - D + l are substantially complementary, the substantial 
complementarity between I = i + d and J = j - D is examined, and 
if they are substantially complementary, the substantial 
complementarity between I = i + D + l and J = j - D . i (s tep 5) 
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is examined, and this step is repeatedly prosecuted until there 
appear bases which are not substantially complementary. The 
double-stranded chain formed by the substantially complementary 
regions thus obtained is examined for its AG and r (step 6). 
A G is determined using the nearest neighbor parameters (see 
Table 1), and r is expressed as the distance (in terms of number 
of bases) between the nearest sites in the substantially 
complementary chain regions. Then, the ability of the resulting 
complementary regions to form a substantially cbmplementary 
double-stranded chain is calculated by substituting these values 
in the above formula I ( step 7 ) . The value thus obtained is 
assigned to the respective bases (bases at i to i + D - 1 (or i 
to i + D + x) and bases at j to j - D + 1 (or j to j - D - x) ) 
in the substantially complementary region by adding this value 
to their previous values if any (EFB#(I) and EFB#(J)), where x = 

0 or a positive integer, I = i to i + D - loritoi+D+x, 
and J = j to j - D + 1 or j to j - D - x. 

Then, it is judged whether it is allowable to examine the 
substantial complementarity between a nucleotide sequence 
beginning at I = i and a nucleotide sequence beginning at J = j 
- 1 (step 8). If (j - D) - (i + D - 1) is not less than L + 1, 
the answer is "yes", otherwise "no". 

If the answer in step 8 is "no", it is then judged whether 

1 = i + 1 is allowable (step 9). That is, if the sum of number 
i and number D exceeds the number of all bases, then I = i + 1 
is not allowable. If the answer in step 9 is "yes", step 3 is 
repeatedly prosecuted to examine bases at I = (i + 1) to (i + D) 
for their complementarity. If the answer in step 9 is "no", the 



summed values (EFB#(I)) assigned to the respective bases in an 
NAS sequence in the position I (i.e. a region at i to (i + NAS - 
1)) are summed up and assigned to said NAS sequence (step 10). 
The value of AS#(i) is the summed ability of the NAS nucleotide 
sequence (a region of from i to (i + NAS - 1)) to form a 
substantially complementary chain and this value can be used as 
an indication of its effectiveness as antisense oligonucleotide. 

As is evident from the foregoing, a nucleotide sequence 
with a less value of AS#(i) can be expected to serve as 
effective antisense oligonucleotide. 

After step 10 was finished , the result is printed out. 

The effect of the present invention is as follows: 

According to the present invention, a target site for 
antisense oligonucleotide can be predicted reliably for a target 
RNA nucleotide sequence without conducting any experiment, and 
the method of the present invention can be used to prepare 
oligonucleotide useful in biochemistry and molecular biology, 
particularly antisense oligonucleotide useful as therapeutic 
agents, diagnostic agents and agents for research purposes. 

Examples 

The present invention is described in more detail by 
reference to Examples. The present invention, however, is not 
limited to Examples. 
Example 1 

A region (635 bases between the 38- and 672-positions in 
SEQ ID NO:l) containing a nucleotide sequence coding for human- 
derived vascular endothelial growth factor (VEGF121, referred to 



hereinafter as "VEGF" ) in a plasmid was evaluated by a BASIC 
program for the ability of its respective bases to form a 
substantially complementary double-stranded chain. 

In this example, the parameters were set at L = 4, D = 4 , 
F = 6, and NAS = 20 in the flow chart in FIG- 4. 

According to these parameters, the whole nucleotide 
sequence of the above 635-base mRNA was examined for the 
complementarity between a specific region of 4 or more bases and 
another site apart by 5 or more bases from said specific region. 

A specific region in the nucleotide sequence between the 
38- and 672-positions in SEQ ID NO:l was examined for its 
ability to form a substantially complementary double-stranded 
chain with another region. This ability was numerically 
expressed where a higher ability to form a substantially 
complementary double-stranded chain was given a larger value, 
and the value thus obtained was assigned to the corresponding 
bases. The values assigned to each base for its substantial 
complementarity to all other regions were summed up. In 
addition to the base pairs of G and C and of A and U, the base 
pair of G and U was also assumed to be a base pair forming a 
substantially double-stranded chain. 

A higher ability to form a substantially double-stranded 
chain was also given when the base pair has higher Gibbs free 
energy as calculated in the nearest neighbor model (S. M. Freier 
et al., Proc. Natl. Acad. Sci. USA, 83, 9373-9377 (1986)). The 
values determined by Naoki Sugimoto et al. was used as the 
parameter in the nearest neighbor model (Table 1). The highest 
ability to form a substantially double-stranded chain was given 
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when the distance between the nearest bases in substantially 
complementary regions were apart by 5 bases , and a less ability 
was given as the distance increases- Specific sites were set 
such that they were apart by 1 base from each other in the 
sequence, and a sequence beginning at each specific site was 
examined for the ability to form a substantially double-stranded 
chain. The sum of values assigned to each base was determined , 
then the summed values of 20 bases were added together and 
assigned to the base at the lowest-number position in said 20 
bases . 

The logarithms of some values thus obtained were plotted 
against base number (■ in FIG. 5). In FIG. 5, experimental 
results obtained using a cell-free transcription and translation 
system (shown in see WO96/00286) are also shown. A larger 

value is given on the ordinate in this graph when the ability to 
form a substantially double-stranded chain as determined by the 
calculation is higher or when the degree of expression of VEGF 
as determined by the experiment is higher. If there is 
correlation between the two results, their plots must have 
similar patterns. 

As can be seen from FIG. 5, both the plots are roughly 
consistent. That is, the expression of VEGF is significantly 
inhibited by antisense oligonucleotide to the region at the 400- 
to 500-positions, while the ability of this region to form a 
substantially complementary double-stranded chain is low. 
Further, the expression of VEGF is high in the experimental 
results when antisense oligonucleotides beginning at bases 77, 
173, 209, 245, 269, 335, 371 and 533 are used, while the ability 



of nearly all of them to form a substantially complementary 
double-stranded chain is high. 

As for nucleotides with NAS (=20) bases having a logarithm 
of 5.5 or less (on the right ordinate) as the sum of the 
respective values of their bases for the ability to form a 
substantially complementary double-stranded chain, 82% of them 
brought about 10 % or less expression of VEGF, and thus it is 
evident that both the results are in good correlation. 
Therefore, effective antisense oligonucleotide can be easily 
obtained by preparing a sequence complementary to a region with 
a low logarithm (5.5 or less) using a known synthetic method etc. 
Example 2 

For cases where the inhibition of expression of VEGF in 
the cell-free transcription and translation system shown in 
Example 1 was significant (♦ in FIG. 5), the production of VEGF 
was examined similarly except the concentrations of the 
antisense oligonucleotide and RNase H were 1/5 (80 nM) and 1/50 
(0.0092 unit///, 1) of concentrations in Example 1, respectively 
(FIG. 6). A143T herein used is antisense oligonucleotide 
(natural-type oligo-DNA) against a 20-mer beginning at the 143- 
position in SEQ ID NO:l. Similarly, A197T, A227T etc. are 
antisense oligonucleotide (natural-type oligo-DNA) against 20- 
mers beginning at the 197-position and the 227-position etc. in 
SEQ ID NO:l. The results indicated that as shown in FIG. 6, the 
expression of VEGF was inhibited in the presence of A47 3T, A47 9T, 
A485T, A491T, A497T, A503T, and A509T, among which A485T and 
A491T exhibited particularly strong inhibition, indicating that 
their antisense effect was significant. Thus results agreed 
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well with the calculated results in FIG. 5 (the lowest value 
(■) is indicated with A491T to A509T), and the method herein 
proposed can be used to identify a site having antisense effect. 
The antisense oligonucleotide shown in FIG. 6 did not inactivate 
the cell-free transcription and translation system itself, 
because in a similar experiment using a plasmid having a 
nucleotide sequence coding for lucif erase in place of VEGF, the 
expression of lucif erase was not inhibited. Therefore , it could 
be concluded that the inhibitory effect on expression as shown 
in FIG. 6 can be contributed to antisense effect. 
Example 3 

The ability of a nucleotide sequence (including its 5 f - 
upstream and 3 1 -downstream regions ) coding for human-derived 
VEGF to form a substantially complementary double-stranded chain 
was evaluated in the manner as described in Example 1 . The 
nucleotide sequence herein used is a sequence consisting of 1873 
bases prepared by reference to a literature (J. Biol., 266 f 
11947-11954 (1991) and Science 246, 1309-1312 (1989)), and its 
nucleotide sequence is shown in SEQ ID NO: 2. The first base in 
SEQ ID NO:2 (origin of transcription) was assigned as 63- 
position for correspondence to SEQ ID NO:l. Accordingly, the 
origin of translation was 1101-position corresponding to the 
101-position in SEQ ID N0:1. The results indicated that the 7 
nucleotide sequences shown in Table 2 are prospective antisense 
oligonucleotide. They were synthesized by an automatic 
synthesizer using the phosphor oamidite method and examined for 
their antisense effect on human lung cancer-derived A549 cells 
and human f ibroblast-derived HT1080. 
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Table 2 



Antisense 
Oligo- 
nucleotide # 


Nucleotide Sequence of 

Antisense 

Oligonucleotide 


SEQ 
ID NO: 


Inhibitory Effect 
on VEGF Expression 


A549 


HT1080 


U0370T-S 


ACCTCTTTCCTCTTTCTGCT 


4 


+ 


+ + 


U0406T-S 


CTCTCTCTTCCTCGACTTCT 


5 


| 1 


1 1 


U0413T-S 


ACCCCGTCTCTCTCTTCCTC 


6 


+ + 




UQ853T-S 


CTCCTCTTCCTTCTCTTCTT 


•7 

/ 


+ + 


+ + 


U1598T-S 


GTTCTGTATCAGTCTTTCCTG 


8 


+ 


+ 


U1676X-S 


CTTCATTTCAGGTTTCTGGATTAA 


9 


+ 


+ • 


A485T-S 
(1485T-S) 


TCTTTCTTTGGTCTGCATTC 


10 


+ 


+ 



A549 cells were cultured in MEM medium containing 10 % FBS 
in a 48-well plate in a 5 % C0 2 atmosphere at 37 "C. When the 
number of cells reached about 1 to 3X10 5 cells/well, they were 
incubated for 2 hours in a 5 % C0 2 atmosphere at 37 °C in OPTI- 
MEM medium (in the absence of serum) containing 5.25 u g/200 fi 1 
(about 15 ii M) Tfx-50 (Promega) and 2.3 u M antisense 
oligonucleotide. Thereafter , the medium was exchanged with 
fresh DMEM medium containing 10 % FBS, and they were incubated 
for 3 hours in a 5 % C0 2 atmosphere at 37 V, , during which VEGF 
was released into the medium. The amount of VEGF was determined 
by the EL ISA method using an anti-VEGF polyclonal antibody as 
primary antibody and a peroxidase-labeled anti-VEGF polyclonal 
antibody as secondary antibody (WO96/00286 ) . 
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HT1080 cells were cultured in MEM medium containing 10 % 
FBS in a 48-well plate in a 5 % C0 2 atmosphere at 37 *C. When 
the number of cells reached about 4 to 7Xio 4 cells /well, they 
were incubated for 2 hours in a 5 % C0 2 atmosphere at 37 t in 
OPTI-MEM medium (in the absence of serum) containing 2.5 to 3.5 
U g/200 n 1 (about 7 to 10 n M) Tfx-50 (Promega) and 1.6 to 2.3 
H M antisense oligonucleotide. Thereafter , the medium was 
exchanged with fresh DMEM medium containing 10 % FBS, and the 
cells were incubated for 2 hours in a 5 % C0 2 atmosphere at 37 "C, 
during which VEGF was released into the medium. The amount of 
VEGF was determined by the ELISA method using an anti-VEGF 
polyclonal antibody as primary antibody and a peroxidase-labeled 
anti-VEGF polyclonal antibody as secondary antibody (WO96/00286 ) . 

Table 2 shows the results where is given when the 

production of VEGF was significantly inhibited and "++" is given 
when the amount of VEGF produced was 50 % or less as compared 
with the production of VEGF in the presence of the 
phosphorothioate-type oligo-DNA (EA419T-S having the nucleotide 
sequence CTAGACTGTGTGTTCTGGAG (SEQ ID NO:3)) as the control. As 
is evident from the results, every antisense oligonucleotide 
selected by calculation significantly inhibited the production 
of VEGF, and about half of the examined 7 sequences inhibited 
50 % or more expression of VEGF as compared with the control. 
Because a decrease in the number of both the cells was only 30 % 
or less as compared with the case where the phosphorothioate- 
type oligo-DNA was not added, it was confirmed that the 
antisense effect was not caused by toxicity. 
Example 4 . 
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The ability of a luciferase gene to form a substantially 
complementary double- stranded chain was evaluated in the manner 
as described in Example 1. As the gene coding for luciferase, a 
nucleotide sequence consisting of 1150 bases shown in SEQ ID 
NO: 11 was calculated. Here, the first base in SEQ ID NO: 11 was 
assigned as 38-position, and the last base as 1187-position. 5 
sequences (A653T, A960T, A996T, A1405T, and A1446T) which were 
identified by calculation as prospective candidate for antisense 
oligonucleotide and 50 randomly selected sequences (A085T to 
A575T) which were apart by 10 bases from one another were 
examined for their ability to inhibit the expression of 
luciferase in the same cell-free transcription and translation 
system as in Example 1. The results are shown in Table 3. 

Table 3 



NO. 


Designation 


Position of 
Nucleotide 


Expression 
Ratio 


1 


A085T 


85-104 


4% 


2 


A095T 


95-114 


9% 


3 


A105T 


105-124 


11% 


4 


A115T 


115-134 


23% 


5 


A125T 


125-144 


23% 


6 


A135T 


135-154 


6% 


7 


A145T 


145-164 


11% 


8 


A155T 


155-174 


12% 


9 


A165T 


165-184 


3% 


10 


A175T 


175-194 


5% 1 


11 


A185T 


185-204 


14% 


12 


A195T 


195-214 


8% 


13 


A205T 


205-224 


13% 


14 


A215T 


215-234 


7% 


15 


A225T 


225-244 


10% 


16 


A235T 


235-254 


10% 


17 


A245T 


245-264 


10% 



2 9 



18 


A255T 


255-274 


112% 


19 


A265T 


265-284 


101% ! 


20 


A275T 


275-294 


21% 


21 


A285T 


285-304 


8% 


22 


A295T 


295-314 


14% 


23 


A305T 


305-324 


60% 


24 


A315T 


315-334 


10% 


25 


A325T 


325-344 


4% 


26 


A335T 


335-354 


7% 


27 


A345T 


345-364 


33% 


28 


A355T 


355-374 


17% 


29 


A365T 


365-384 


23% 


30 


A375T 


375-394 


13% 


31 


A385T 


385-404 


12% 


32 


A395T 


395-414 


35% 


33 


A405T 


405-424 


1% 


34 


A415T 


415-434 


12% 


35 


A425T 


425-444 


25% 


36 


A435T 


435-454 


12% 


37 


A445T 


445-464 


20% 


38 


A455T 


455-474 


26% 


39 


A465T 


465-484 


8% 


40 


A475T 


475-494 


11% 


41 


A485T 


485-504 


6% 


42 


A495T 


495-514 


3% 


43 


A505T 


505-524 


6% 


44 


A515T 


515-534 


24% 


45 


A525T 


525-544 ! 


12% 


46 


A535T 


535-554 


5% 


47 


A545T 


545-564 


2% 


48 


A555T 


555-574 


5% 


49 


A565T 


565-584 


12% 


50 


A575T 


575-594 


22% 


51 


A653T 


653-672 


1% 


52 


A960T 


960-979 


6% 


53 


A996T 


996-1015 


5% 


54 


A1405T 


1405-1424 


2% 


55 


A1446T 


1446-1465 


33% 
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In Table 3, antisense oligonucleotide numbers, such as 
A653T etc., have the same meaning as in the previous patent 
(WO96/00286) or as described above, and A653T refers to 
antisense oligonucleotide (natural-type oligo-DNA) against a 20- 
mer beginning at the 653-position in SEQ ID NO: 11. The first 
base in SEQ ID NO: 11 was assigned as 38-position, and the last 
base as 1 187 -positions . That is, the nucleotide sequence of 
A653T is CATTATCAGTGCAATTGTTT (SEQ ID NO: 12), Similarly, A960T 
and A990T refer to antisense oligonucleotide (natural type 
oligo-DNA) against 20-mers beginning respectively at the 960- 
and 990-positions in SEQ ID NO: 11. 

As shown in Table 3, the 5 nucleotide sequences predicted 
to be suitable antisense oligonucleotide by calculation results 
demonstrated significant inhibition as compared with the 50 
randomly selected nucleotide sequences. For example, 10 % or 
less expression was achieved by 22 (44 %) of the 50 nucleotide 
sequences randomly selected, whereas the same degree expression 
was achieved by 4 (80 %) of the 5 nucleotide sequences predicted 
to be suitable as antisense oligonucleotide. 
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SEQUENCE LISTING 
SEQ ID NO: 1 
LENGTH: 774 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULAR TYPE: mRNA 
SEQUENCE: 

UUAUGUAUCA UACACAUACG AUUUAGGUGA CACUAUAGAA UACAAGCUUA UGCAUGCGGC 60 
CGCAUCUAGA GGGOCCGGCC CCGGUCGGGC CUCOGAAACC AUGAACUUUC UGCUGUCUUG 120 
GGUGCAUUGG AGCCUUGCCU UGCUGCUCUA CCUCCACCAU GCCAAGUGGU CCCAGGCUGC 180 
ACCCAUGGCA GAAGGAGGAG GGCAGAAUCA UCACGAAGUG GUGAAGUUCA UGGAUGUCUA 240 
UCAGCGCAGC UACUGCCAUC CAAUCGAGAC CCUGGUGGAC AUCUUCCAGG AGUACCCUGA 300 
UGAGAUCGAG UACAUCUUCA AGCCAUCCUG UGUGCCCCUG AUGCGAUGCG GGGGCUGCUG 360 
CAAUGACGAG GGCCUGGAGU GUGUGCCCAC UGAGGAGUCC AACAUCACCA UGCAGAUUAU 420 
GCGGAUCAAA CCUCACCAAG GCCAGCACAU AGGAGAGAUG AGCUUCCUAC AGCACAACAA 480 
AUGUGAAUGC AGACCAAAGA AAGAUAGAGC AAGACAAGAA AAAUGUGACA AGCCGAGGCG 540 
GUGAGCCGGG CAGGAGGAAG GAGCCUCCCU CAGGGUUUCG GGAACCAGAU CCACUAGUUC 600 
UAGAUGCAUG CUCGAGCGGC CGCCAGUGUG AUGGAUAUCU GCAGAAUUCC AGCACACUGG 660 
CCGUUACUAG UGGAUCCGAG CUCCCAAAAA AAAAAAAAAA AAAAAAAAAA AAAAACOGAA 720 
UUAAUUCGUA AUCAUGGUCA UAGCUGUUUC CUGUGUGAAA UUGUUAUCCG CUCA 774 

SEQ ID NO: 2. 
LENGTH: 1873 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: mRNA 
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SEQUENCE : 



U LuUjUnvja\-. 


t tt TTL/^n/^f^ rp_p cnnrsx^nnxin 
U VA3\^o\^ Avj\~ V* VctovaU AvjUU v.. 


PP_A PPTTPPJTP 


GCGCU GSaGGG 


pT TA PP APPA P 

CUAGCAC CAG 


60 


p r^f^T TP T Tp T TP P 

CGCUCUGUCG 


P PA Clf^ PP PAP PPPTTTT APPJTP 

GGAGGCGCAG CGGUUAGGUG 


P APPPPTTPA P 

GACCGGU CAG 


PPP APT TP AP P 

CGGACUCAC C 


GGCCAGGGCG 


120 


p t tp w t tp pt t p 
CUCGGUGCUG 


P A ATTTTTTP ATT A T TT TP AT TT TP AT TP 

GAAUU UGAJJ A UUCAUUGAUC 


P PP PT TT TT TT TA T t 

C GGGU UU UAU 


P PPT TPT TT TPT 1 T T 

CCCUCUUCUU 


TTTTf TTl/^T Tf 5 5k Tk 5k 

UUUUCUUAAA 


180 


P ATTTTTTTTTTTTTTT7 

CAUUUUu UUU 


TTAAAAPfTPTTA T TT TPT TT TT TP T TP P 

UAAAACUGUA UUGUUUC UC G 


TTTTTTTTA ATTTTTTA 

UuUlJAAIJUUA 


T TT TfTT TT T PP T TT T /** 

UUUUUGCUUG 


TYTT/5/5/5/5 5k 

CCAUUCCCCA 


240 


PTTTTPA ATTPPP 

CUUGAAUCGG 


PPPPA /V/TTT TT/^/^/T 1 AP ATTTT 

GCCGACGGCU UGGGGAGAUU 


/^/TTT/5f t »/5TTf T/5 

GCUCuACUuC 


/"t/^y^5k 5k 5kTTy^5k y*5 

CCCAAAUCAC 


UGUGGAUUUU 


300 


/•^ /"™»5k 5k jk /"V** 5k /"* /""I 

GGAAACCAGC 


Tk P>A * Tk PTk 5k Tk -m 5k y"i 

AGAAAGAGGA AAGAGGUAGC 


AAGAGCUCCA 


GAGAGAAGUC 


GAGGAAGAGA 


360 


GAGACGGGGU 


P HP HP TkP Tk PP /^/V , /^/ , / , ^/V»TT 

CAGAGAGAGC GCGCGGGCGU 


GCGAGCAGCG 


5k 5k 5k y^y^y>^5k y^5k y*» 

AAAGCGACAG 


y/*^ "Ik -5Jk «fh y W 5> y**5| 

GGGCAAAGUG 


420 


1* y 1 «f T^^5k ^OttO^ 

AGUGACCUGC 


UUUUGGGGGU GACCGCCGGA 


GCGCGGCGUG 


AGCCCUCCCC 


CUUGGGAUCC 


480 


5k ^/^TT/^H y"5 

CGCAGCUGAC 


y^l 5k T T/*5 y<y^ ^^y^ V t y^ *m y^ y*wy^ 5i y^ * y"* ~& 

CAGUCGCGCU GACGGACAGA 


CAGACAGACA 


CCGCCCCCAG 


CCCCAGCUAC 


540 


CACCUCCUCC 


CCGGCCGGCG GCGGACAGUG 


GACGCGGCGG 


CGAGCCGCGG 


GCAGGGGCCG 


600 


GAGCCCGCGC 


y"^ y^ly*^ ^T^k y^y^ y%y« .xt y"^ y^pr T y*ty<^ Tk y»> y>^ 

CCGGAGGCGG GGUGGAGGGG 


GUCGGGGCUC 


GCGGCGEJCGC 


ACUGAAACUU 


660 


T TT T^l y— *T" T y-Sy^ Tk Ik y-| 

UUCGUCCAAC 


UUCUGGGCUG UUCUCGCUUC 


GGAGGAGCCG 


UGGUCCGCGC 


GGGGGAAGCC 


720 


P Tk P /"VI P5k /"»/"5 y"5 

GAGCCGAGCG 


GAGCCGCGAG AAGUGCXJAGC 


T *y«y« ****** y«y« m/« — i 

UCGGGCCGGG 


AGGAGCCGCA 


GCCGGAGGAG 


780 


P PP y**5k y^/"* 5k /T 

GGGGAGGAGG 


Tk 5k y^ Tk 5k y^5k y^5k 5i y>^y*55k ■* « 5k y^5k y^5 

AAGAAGAGAA GGAAGAGGAG 


AGGGGGCCGC 


AGUGGCGACU 


CGGCGCUCGG 


840 


5k I* y-^ OP PP PP T T 

AAGCCGGGCU 


CAUGGACGGG UGaGGCGGCG 


y^ T t/"^ t Ty^ y^y^i y^ t» 

GUGUGCGCAG 


ACAGUGCUCC 


AGCCGCGCGC 


900 


GCU CC CC AGG 


pp/^ITTPPPPP/^ /5/5y5/5rT/5y5y5y5y^ 

CCCUGGCCCG GGCCUCGGGC 


CGGGGAGGAA 


GAGUAGCUCG 


y^ y^y^ 5k y^y-» y»^y«-l 

CCGAGGCGCC 


y"\ /N 

960 


toAGVkiAGAGC G 


P PP PP PP pp Tk PAP PP PP Ik P P 

GGC CGCC CCA CAGCCCGAGC 


P/"«P Tk/5 5k 

CGGAGAGGGA 


GCGCGAGCCG 


/-^ /V5 /^/*5 y^y*5 y-i 

CGCCGGCCCC 


1020 


GGU CGGGCCU 


PPPAAAPPATT P A Ik PTTTTf T/"5TTJ"* 

CCGAAACCAU GAACUUUCUG 


CUGUCUUGGG 


* T ^ty^ tl T tt T/^ 5k y^ 

UGCAUUGGAG 


CCUUGCCUUG 


1080 


PTTPPTTPTT *rin 

C UGCU CU AC C 


ttppappattpp p a a pttppttpp 

U CC AC CAUGC CAAGUGGUCC 


CAGGCUGCAC 


y^5k Tt/^ r^f~\ Tk P 5k 

CCAUGGCAGA 


Tk y^»y~» 5k y~* 5k y"*< y-« y*^ 

AGGAGGAGGG 


1140 


r* aptaattp attp 
UAiaAAUCAUC 


A PP A A PTTPPTT P A A PT TT TP A T TP 

ACvjAAGU GGu GAAGU UC AU G 


P ATTPTTPITT AT TP 

GAUGUCUAUC 


A PP /""V*5 5k P PT T 5k 

AGCGCAGCuA 


CUGCCAIJCCA 


1200 




TT PPT TP PA PATT PTTTTPP APPA P 

U\^U\jVoALAU CUUCCAGGAG 


t T A P PP T TP AT TP 

U AC CC UCxAU G 


A PA TTPPA PTT 7k 

AGAUC GAGUA 


P Tk T TPT TT TP 7k Tk P 

CAuCUUCAAG 


1260 




T irZT* PP PT TP.7V T T PPP ATT/^PPj^P 


pPPTTppTTpp 5k 

vjoC UGCU GC A 


A T TP APPA PP P 

AUGAC GAGGG 


PPTTPP APTTPTT 

C CU GGAGUGU 


1320 


GUGCCCACUG 


AGGAGUCCAA CAUCACCAUG 


CAGAUUAUGC 


GGAUCAAACC 


UCACCAAGGC 


1380 


CAGCACAUAG 


GAGAGAUGAG CUUCCUACAG 


CACAACAAAU 


GUGAAUGCAG 


ACCAAAGAAA 


1440 


GAUAGAGCAA 


GACAAGAAAA AUGUGACAAG 


CCGAGGCGGU 


GAGCCGGGCA 


GGAGGAAGGA 


1500 


GCCUCCCUCA 


GGGUUUCGGG AACCAGAUCU 


CUCACCAGGA 


AAGACUGAUA 


CAGAACGAUC 


1560 


GAUACAGAAA 


CCACGCUGCC GCCACCACAC 


CAUCACCAUC 


GACAGAACAG 


UCCUUAAUCC 


1620 
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AGAAACCUGA AAUGAAGGAA GAGGAGACUC UGCGCAGAGC ACUUUGGGUC CGGAGGGCGA 1680 
GACUCCGGCG GAAGCAUUCC CGGGCGGGUG ACCCAGCACG GUCCCUCUUG GAAUUGGAUU 1740 
CGCCAUUUUA UUUUUCUUGC UGCUAAAUCA CCGAGCCCGG AAGAUUAGAG AGUUUUAUUU 1800 
CUGGGAUUCC UGUAGACACA CCCACCCACA UACAUACAUU UAUAUAUAUA UAUAUUAUAU 1860 
AUAUAUAAAU UAA 1873 

SEQ ID NO: 3 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

CTAGACTGTG TGTTCTGGAG 20 

SEQ ID NO: 4 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

ACCTCTTTCC TCTTTCTGCT 20 

SEQ ID NO: 5 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS: single 
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TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

CTCTCTCTTC CTCGACTTCT 

SEQ ID NO: 6 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

molecular TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

ACCCCGTCTC TCTCTTCCTC 

SEQ ID NO: 7 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE: 

CTCCTCTTCC TTCTCTTCTT 

SEQ ID NO: 8 
LENGTH : 2 1 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



MOLECULAR TYPE : other nucleic acid (synthetic DNA) 
SEQUENCE : 

GTTCTGTATC AGTCTTTCCT G 

SEQ ID NO: 9 
LENGTH: 24 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

CTTCATTTCA GGTTTCTGGA TTAA 

SEQ ID NO: 10 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
SEQUENCE : 

TCTTTCTTTG GTCTGCATTC 



SEQ ID NO: 11 
LENGTH: 1150 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULAR TYPE: mRNA 



SEQUENCE: 

GAAUACAAGC UUAUGCAUGC GGCCGCAUCU AGAGGGCCCG GAUCCAAAUG GAAGACGCCA 60 
AAAACAUAAA GAAAGGCCCG GCGCCAUUCU AUCCUCUAGA GGAUGGAACC GCUGGAGAGC 120 
AACUGCAUAA GGCUAUGAAG AGAUACGCCC UGGUUCCUGG AACAAUUGCU UUUACAGAUG 180 
CACAUAUCGA GGUGAACAUC ACGUACGCGG AAUACUUCGA AAUGUCCGUU CGGUUGGCAG 240 
AAGCUAUGAA ACGAUAUGGG CUGAAUACAA AUCACAGAAU CGUCGUAUGC AGUGAAAACU 300 
CUCUUCAAUU CUUUAUGCCG GUGUUGGGCG CGUUAUUUAU CGGAGUUGCA GUUGCGCCCG 360 
CGAACGACAU UUAUAAUGAA CGUGAAUUGC UCAACAGUAU GAACAUUUCG CAGCCUACCG 420 
UAGUGUUUGU UUCCAAAAAG GGGUUGCAAA AAAUUUUGAA CGUGCAAAAA AAAUUACCAA 480 
UAAUCCAGAA AAUUAUUAUC AUGGAUUCUA AAACGGAUUA CCAGGGAUUU CAGUCGAUGU 540 
ACACGUUCGU CACAUCUCAU CUACCUCCCG GUUUUAAUGA AUACGAUUUU GUACCAGAGU 600 
CCUUUGAUCG UGACAAAACA AUUGCACUGA UAAUGAAUUC CUCUGGAUCU ACUGGGUUAC 660 
CUAAGGGUGU GGCCCUUCCG CAUAGAACUG CCUGCGUCAG AUUCUCGCAU GCCAGAGAUC 720 
CUAUUUUUGG CAAUCAAAUC AUUCCGGAUA CUGCGAUUUU AAGUGUUGUU CCAUUCCAUC 780 
ACGGUUUUGG AAUGUUUACU ACACUCGGAU AUUUGAUAUG UGGAUUUCGA GUCGUCUUAA 840 
UGUAUAGAUU UGAAGAAGAG CUGUUUUUAC GAUCCCUUCA GGAUUACAAA AUUCAAAGUG 900 
CGUUGCUAGU ACCAACCCUA UUUUCAUUCU UCGCCAAAAG CACUCUGAUU GACAAAUACG 960 
AUUUAUCUAA UUUACACGAA AUUGCUUCUG GGGGCGCACC UCUUUCGAAA GAAGUCGGGG 1020 
AAGCGGUUGC AAAACGCUUC CAUCUUCCAG GGAUACGACA AGGAUAUGGG CUCACUGAGA 1080 
CUACAUCAGC UAUUCUGAUU ACACCCGAGG GGGAUGAUAA ACCGGGCGCG GUCGGUAAAG 1140 
UUGUUCCAUU 



1150 



SEQ ID NO: 12 
LENGTH: 20 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 

MOLECULAR TYPE: other nucleic acid (synthetic DNA) 
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SEQUENCE : 

CATTATCAGT GCAATTGTTT 



